Automatic Transcription of Meetings Using Topic-oriented Language Model Adaptation

نویسندگان

  • Yuya AKITA
  • Carlos TRONCOSO
  • Tatsuya KAWAHARA
چکیده

This paper presents an automatic speech recognition (ASR) system dedicated for meetings of the National Congress of Japan. The distinctive features of the congressional meeting speech are wide distribution and frequent change of topics. For more accurate transcription, such topics should be emphasized in a language model one after another. Therefore, we introduce two approaches for topic adaptation: PLSA-based approach and trigger-based approach. The PLSA-based adaptation is performed turn by turn to emphasize topics in individual pair of a question and answer. Since topics are treated in a probabilistic manner, robust adaptation is realized. On the other hand, the trigger-based adaptation stresses relevant words to the word history, thus long-distance context can be reflected into a language model. These two approaches were evaluated on real meetings of the Congress, and significant improvement of perplexity was obtained by both approaches. We also compared their effects on reduction of word error rates.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Trigger-based language model adaptation for automatic meeting transcription

We present a novel trigger-based language model adaptation method oriented to the transcription of meetings. In meetings, the topic is focused and consistent throughout the whole session, therefore keywords can be correlated over long distances. The trigger-based language model is designed to capture such long-distance dependencies, but it is typically constructed from a large corpus, which is ...

متن کامل

Language Model Adaptation Based on PLSA of Topics and Speakers for Automatic Transcription of Panel Discussions

Appropriate language modeling is one of the major issues for automatic transcription of spontaneous speech. We propose an adaptation method for statistical language models based on both topic and speaker characteristics. This approach is applied for automatic transcription of meetings and panel discussions, in which multiple participants speak on a given topic in their own speaking style. A bas...

متن کامل

Unsupervised language model adaptation based on topic and role information in multiparty meetings

We continue our previous work on the modeling of topic and role information from multiparty meetings using a hierarchical Dirichlet process (HDP), in the context of language model adaptation. In this paper we focus on three problems: 1) an empirical analysis of the HDP as a nonparametric topic model; 2) the mismatch problem of vocabularies of the baseline n-gram model and the HDP; and 3) an aut...

متن کامل

Implicitly Supervised Language Model Adaptation for Meeting Transcription

We describe the use of meeting metadata, acquired using a computerized meeting organization and note-taking system, to improve automatic transcription of meetings. By applying a two-step language model adaptation process based on notes and agenda items, we were able to reduce perplexity by 9% and word error rate by 4% relative on a set of ten meetings recorded in-house. This approach can be use...

متن کامل

Language model adaptation based on PLSA of topics and speakers

We address an adaptation method of statistical language models to topics and speaker characteristics for automatic transcription of meetings and discussions. A baseline language model is a mixture of two models, which are trained with different corpora covering various topics and speakers, respectively. Then, probabilistic latent semantic analysis (PLSA) is performed on the same respective corp...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006